Search results for " data mining"

showing 10 items of 34 documents

Rings for Privacy: an Architecture for Large Scale Privacy-Preserving Data Mining

2021

This article proposes a new architecture for privacy-preserving data mining based on Multi Party Computation (MPC) and secure sums. While traditional MPC approaches rely on a small number of aggregation peers replacing a centralized trusted entity, the current study puts forth a distributed solution that involves all data sources in the aggregation process, with the help of a single server for storing intermediate results. A large-scale scenario is examined and the possibility that data become inaccessible during the aggregation process is considered, a possibility that traditional schemes often neglect. Here, it is explicitly examined, as it might be provoked by intermittent network connec…

020203 distributed computingInformation privacyDistributed databasesDistributed databaseSettore ING-INF/03 - TelecomunicazioniComputer scienceReliability (computer networking)Secure Multi-Party Computation02 engineering and technologycomputer.software_genreSecret sharingData Mining; Data privacy; Distributed databases; Peer-to-peer computing; Secret sharing; Secure Multi-Party ComputationComputational Theory and MathematicsHardware and ArchitectureServerSignal Processing0202 electrical engineering electronic engineering information engineeringSecure multi-party computationData MiningData miningPeer-to-peer computingC-means data mining Privacy secret sharing secure multi-party computationSecret sharingcomputerData privacy

researchProduct

Fragments of peer review: A quantitative analysis of the literature (1969-2015)

2018

This paper examines research on peer review between 1969 and 2015 by looking at records indexed from the Scopus database. Although it is often argued that peer review has been poorly investigated, we found that the number of publications in this field doubled from 2005. A half of this work was indexed as research articles, a third as editorial notes and literature reviews and the rest were book chapters or letters. We identified the most prolific and influential scholars, the most cited publications and the most important journals in the field. Co-authorship network analysis showed that research on peer review is fragmented, with the largest group of co-authors including only 2.1% of the wh…

0301 basic medicineScience and Technology WorkforceResearch Quality Assessmentlcsh:MedicineCareers in ResearchPeer review co-authorship collaboration communityCitation analysisCentralityData MiningSociologylcsh:ScienceMultidisciplinary05 social sciencesScientometricsco-authorshipResearch AssessmentKnowledge sharingProfessionsCitation AnalysiscommunityNetwork AnalysisResearch ArticleComputer and Information SciencesScience PolicyAbstracting and IndexingPeer ReviewAbstracting and Indexing as Topic ; Animals ; Data Mining ; Databases Bibliographic ; History 20th Century ; History 21st Century ; Humans ; Peer ReviewScopusLibrary science050905 science studiesResearch and Analysis MethodsHistory 21st Century03 medical and health sciencesAnimalsHumansScientific Publishinglcsh:RScientometricsHistory 20th CenturyDatabases Bibliographiccollaboration030104 developmental biologyQuantitative analysis (finance)People and PlacesScientistslcsh:QPopulation Groupings0509 other social sciencesScientific publishingCentrality

researchProduct

Unlock ways to share data on peer review

2020

Peer review is the defining feature of scholarly communication. In a 2018 survey of more than 11, 000 researchers, 98% said that they considered peer review important or extremely important for ensuring the quality and integrity of scholarly communication.

0303 health sciencesMultidisciplinarybusiness.industry05 social sciencesdata miningPublic relations050905 science studiesResearch managementBibliometrics ; Scientometrics ; Research Integrity03 medical and health sciencesWork (electrical)Publishingpeer review data miningpeer reviewSociology0509 other social sciencesbusiness030304 developmental biology

researchProduct

Reverse-safe data structures for text indexing

2021

We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm which constructs a z-reverse-safe data structure that has size O(n) and answers pattern matching queries of length at most d optim…

050101 languages & linguisticsComputer sciencedata structure02 engineering and technologyprivacySet (abstract data type)combinatoric0202 electrical engineering electronic engineering information engineering0501 psychology and cognitive sciencesPattern matchingSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazionialgorithmSettore INF/01 - Informatica05 social sciencesSearch engine indexingINF/01 - INFORMATICAdata miningData structureMatrix multiplicationcombinatoricsExponent020201 artificial intelligence & image processingdata structure; algorithm; combinatorics; de Bruijn graph; data mining; privacyAlgorithmAdversary modelde Bruijn graphInteger (computer science)

researchProduct

Sensor Mining for User Behavior Profiling in Intelligent Environments

2011

The proposed system exploits sensor mining methodologies to profile user behaviors patterns in an intelligent workplace. The work is based in the assumption that users’ habit profiles are implicitly described by sensory data, which explicitly show the consequences of users’ actions over the environment state. Sensor data are analyzed in order to infer relationships of interest between environmental variables and the user, detecting in this way behavior profiles. The system is designed for a workplace equipped in the context of Sensor9k, a project carried out at the Department of Computer Science of Palermo University.

Ambient intelligenceExploitAmbient IntelligenceComputer scienceSensor nodeProfiling (information science)Sensor Data MiningData miningcomputer.software_genrecomputer

researchProduct

Medical news aggregation and ranking of taking into account the user needs

2019

The purpose of this work is to develop an intelligent information system that is designed for aggregation and ranking of news taking into account the needs of the user. The online market for mass media and the needs of readers, the purpose of their searches and moments is not enough to find the news is analyzed. A conceptual model of the information aggression system and ranking of news that would enable presentation of the work of the future intellectual information system, to show its structure is constructed. The methods and means for implementation of the intellectual information system are selected. An online resource for aggregation and ranking of news, news feeds and flexible setting…

Bayesian clustering Bayesian networks Content analisis Content ranking Context filtering Data mining Intelligent system Medical news News aggregation User needsCEUR Workshop Proceedings

researchProduct

Blended Learning als Spielfeld für Learning Analytics und Educational Data Mining

2020

Der Einsatz digitaler Lernformate im Blended Learning bietet demnach Chancen in mindestens zwei Bereichen. Zum einen konnen digitale Lernformate direkt die Lernprozesse von Studierenden gunstig beeinflussen, ihre Leistungen verbessern und zudem positive Effekte auf vielen weiteren Ebenen wie der Motivation oder des Selbstkonzeptes bewirken. Zum anderen generieren digitale Lernformate eine Fulle von Daten in vielfaltiger Gestalt. Studierende erzeugen bei der Arbeit mit digitalen Werkzeugen Nutzungsdaten, wie Verweildauern und Aktivitatsprofile, sie produzieren Leistungsdaten aus digitalen Aufgaben, sie hinterlassen Textbeitrage in Foren und Chats. All diese Daten konnen genutzt werden, um mi…

Blended learningPolitical scienceLearning analyticsLibrary scienceEducational data mining

researchProduct

The Urban Landscape and the Real Estate Market. Structures and Fragments of the Axiological Tessitura in a Wide Urban Area of Palermo

2016

The proposed study deals with the urban landscape of Palermo and its possible representation from the perspective of the real estate market analysis. Real estate is one of the most significant types of capital asset and the wide range of its possible utilizations makes complex the interpretation of the market phenomena. The multi-layered reality of such a large city (represented through the sample of 500 properties) needs to be articulated into a significant set of sub-markets in order to outline the complexity and to map the distribution of homogeneous groups of properties within the whole city area. The comparison between quality and price within each cluster allows us to elicit the degre…

Cluster analysisUrban landscape Real estate market Data mining Cluster analysis Urban regenerationUrban regenerationSettore ICAR/22 - EstimoUrban landscapeData miningReal estate market

researchProduct

The Three Steps of Clustering In The Post-Genomic Era

2013

This chapter descibes the basic algorithmic components that are involved in clustering, with particular attention to classification of microarray data.

Clustering high-dimensional dataSettore INF/01 - Informaticabusiness.industryCorrelation clusteringPattern recognitioncomputer.software_genreBiclusteringCURE data clustering algorithmClustering Classification Biological Data MiningConsensus clusteringArtificial intelligenceData miningbusinessCluster analysiscomputerMathematics

researchProduct

A Web Application for Interactive Visualization of European Basketball Data

2020

The statistical analysis of basketball games is a fast-growing field. Certainly, basketball data are scientifically relevant because an appropriate analysis provides a great deal of information about the performance of both players and teams. The number of games played each season generates a large amount of data worth analyzing. Basketball analytics is well established in U.S. leagues. In Europe, however, it has not been duly developed. This study focuses on the top three European team competitions: the EuroLeague, the EuroCup, and the Spanish ACB (Association of Basketball Clubs, acronym in Spanish) league. Their official websites provide access to game data for anyone who is interested, …

Electronic Data ProcessingModels StatisticalInformation Systems and ManagementBasketballbusiness.industryComputer scienceBasketballAthletic PerformanceWeb BrowserOnline SystemsData scienceField (computer science)Computer Science ApplicationsEuropeUser-Computer InterfaceHumansWeb applicationStatistical analysisBasketball gamesbusinessInteractive visualizationSoftwareBig data miningInformation SystemsBig Data

researchProduct